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Abstract. The call-by-need lambda calculus provides an equational framework for rea- 
soning syntactically about lazy evaluation. This paper examines its operational character- 
istics. 

By a series of reasoning steps, we systematically unpack the standard-order reduc- 
tion relation of the calculus and discover a novel abstract machine definition which, like 
the calculus, goes "under lambdas." We prove that machine evaluation is equivalent to 
standard-order evaluation. 

Unlike traditional abstract machines, delimited control plays a significant role in the 
machine's behavior. In particular, the machine replaces the manipulation of a heap using 
store-based effects with disciplined management of the evaluation stack using control-based 
effects. In short, state is replaced with control. 

To further articulate this observation, we present a simulation of call-by-need in a call- 
by-value language using delimited control operations. 



1. Introduction 



From early on, the con nections between lazy evaluation ( Friedman and Wise . 19761 : 
Henderson and Morrid . 1976) and control operat i ons s eemed strong. One of these seminal 
papers on lazy evaluation ([Henderson and Morrid . [197^ ) advocates laziness for its coroutine- 
like behavior. Specifically, it motivates lazy evaluation with a solution to the same fringe 
problem: how to determine if two trees share the same fringe without first flattening each 
tree and then comparing the resulting lists. A successful solution to the problem traverses 
just enough of the two trees to tell that they do not match. The same fringe problem 
is also addressed in Sussman and Steele's original exposition of the Scheme programming 
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language ( Sussman and Steele Jr. . 19751 ). One of their solutions uses a co ntinuation passing- 



style representation of coroutines. More recently, iBiernacki et al.l (j2005l ) explores a number 
of continuation-based solutions to same fringe variants. 

Same fringe is not the only programming problem that can be solved using either lazy 
evaluation or continuations. For instance, l a.zy streams and continu a tions are also used to 
imple ment and reason about backtracking ( Wand and Vaillancourt . 2004 : Kiselvov et al.l . 
2OO5I ). Strong parallels in the literature have long suggested that lazy evaluation elegantly 



embodies a stylized use of coroutines. Indeed, we formalize this connection. 

Call-by-need evaluation combines the equational reasoning capabilities of call-by-name 
with a more efficient implementation technology that systematically shares the results of 
some computations. However, call-by-need's evaluation strategy makes it difficult to reason 
about the operational behavior and space usage of programs. In particular, call- by-need 
eval uation obscures th e contro l flow of evaluation. To facil i tate reasoning, semantic mod - 
els (iLaunchburvl. Il993l: ISe stoft'. 'l997l: iFriedman eraP. l2007l: iNakata and Hase gawa'. '2009"), 
simulations (jOkasaki et al.. . ,1994i ) , and tracing tools (jGibbons and Wansbrough . ,199S ) for 
call-by-need evaluation ha ve been devel oped. Many of these artifacts use an explicit store 
or store-based side-effects ( Wangj . 199d ) to represent values that are shared between parts 
of a program. Stores, being amorphous structures, make it difficult to establish program 
properties or analyze program execution. This representation of program execution loses 
information about the control structure of evaluation. 

The call-by- need lambda calculus was introduced by Ariola et al. ( 19951 ) as an alter- 
native to store-based formalizations of lazy evaluation. It is an equational fram ework for 
reasoning about call-by-need programs and languages. Following Plotkin ( 19751 ). these au- 
thors present a calculus and prove a standardization theorem that links the calculus to a 
complete and deterministic (i.e. standard order) reduction strategy. The calculus can be 
used to formally justify transformations, particularly compiler optimizations, because any 

terms it proves equal are also contextually equivalent ui ider call-by-need evaluation. 

Call-by- need calculi were investigated by two groups ( Maraist et al. . 19981 : Ariola and Felleisen . 

1993). The resulting two calculi are quite similar but their subtle differences yield trade-offs 
that are discussed in the respective papers. Nonetheless, both papers connect their calculi 
to similar standard-order reduction relations. 

One notable feature of Ariola and Felleisen's calculus (and both standard-order reduc- 
tion relations) is the use of evaluation contexts within the notions of reduction. It has been 
observed that eva luation contexts correspond to continuations in some presen tations of lan- 
guage semantics ( Felleisen and Friedman . 1986 : Biernacka and Danvv . 20071 ). However, in 
these systems evaluation contexts are used to model variable references and demand-driven 
evaluation, not first-class continuations. 

This paper exposes how Ariola et al.'s call- by- need evaluation relates to continuations. 
By systematically unpacking the standard-order reduction relation of the calculus, we dis- 
cover a novel abstract machine that models call-by-need style laziness and sharing without 
using a store. Instead, the machine manipulates its evaluation context in a manner that 
corresponds to a stylized use of delimited control operations. The machine's behavior re- 
veals a connection between control operations and laziness that was present but hidden in 
the reduction semantics. 

To directly interpret this connection in the terminology of delimited coi itrol, we con 



struct a simulation of call-by-need terms in the call-by- value language of iDybvig et al. 
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(120071), which provides a general framework for delimited continuations with first-class gen- 
erative prompts. 

Our concrete specifications of the relationship between call-by-need and delimited con- 
trol firmly establish how lazy evaluation relates to continuations and other control-oriented 
language constructs and effects. Implementations of both the machine and the simulation 
are available at the following url: 
http : / / osl . iu . edu/~garcia/call-by-need . tgz. 



2. The Call-by-need Lambda Calculus 



T he remainder of this pape r examines Ariola and Felleisen's formalization of call-by- 
need ( Ariola and Felleisen . 19971 ). The terms of the calculus are standard: 



t 



X I \x.t I 1 1 



The call- by-need calculus, in direct correspondence with the call- by- value and call- by-name 
lambda calculi, distinguishes lambda abstractions as values: 



Xx.t 



Call-by-need is characterized by two fundamental properties: a computation is only per- 
formed when its value is needed, and the result of any computation is remembered and 
shared so that it only needs to be computed once. This calculus distinguishes two addi- 
tional subsets of the term language to help represent these properties. 

To capture the notion of necessary computations, the calculus distinguishes the set of 
lambda abstractions that immediately need the value of their argument. To define this set, 
the calculus appeals to a notion of evaluation contexts, a set of terms that each have a 
single hole (□) in them: 



E 



a \ Et \ {Xx.E[x]) E I (Xx.E) t 



Given the evaluation contexts, the set of lambda abstractions in question is defined syntac- 
tically as {Xx.E[x\), the set of lambda abstractions whose bodies can be decomposed into 
an evaluation context and a free instance of the abstracted variable. 

The intuition for this definition is as follows. Ev aluation contexts ar e essen tially terms 
with a single hole in them. When used in the style of lFelleisen and Hiebl (|l992l ^. evaluation 
contexts indicate those locations in a program that are subject to evaluation. As such, a 
lambda abstractions of the form {Xx.E[x\) will immediately refer to the value of its argument 
when its body is evaluated. Here, E[x\ is not a language construct, but rather metalinguistic 
notation for a side condition on the term in the body of the lambda abstraction. 

The syntactic structure of call-by-need evaluation contexts give some hint to the nature 
of computation captured by this calculus. First, the context production E t indicates that 
evaluation can focus on the operator position of an application regardless of the structure 
of the operand. This property is also true of call- by-name and call- by- value and reflected 
in their respective evaluation contexts. Second, the {Xx.E[x]) E production indicates the 
operand of an application expression can be evaluated only if the operator is a lambda 
abstraction that immediately needs its argument. This restriction does not hold for call-by- 
valu^, where any lambda abstraction in operator position justifies evaluating the operand. 



"'^assuming left-to-right evaluation of application expressions 
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This restriction on evaluation corresponds with our intuitive understanding of call-by-need. 
Third, the (Xx.E) t production indicates that evaluation can proceed under a lambda ab- 
straction when it is in operator position. Though not immediately obvious, this trait is 
used by the calculus to capture the process of sharing computations among subterms. 

To capture the notion of shared computations, the call- by- need calculus distinguishes 
lambda abstractions with explicit bindings for some variables, calling them answers: 



V I (Ax. a) t 



Answers are a syntactic representation of (partial) closures. An answer takes the form of a 
lambda term nested inside some applications. The surrounding applications simulate envi- 
ronment bindings for free variables in the nested lambda term. This representation makes 
it possible for the calculus to explicitly account for variable binding and to syntactically 
model how call-by-need evaluation shares lazily computed values. 
The calculus has three notions of reduction: 



{\x.E[x\) V 


^need 


{\x.E[v]) V 


(Ax. a) ti t2 


^need 


{Xx.a t2) ti 


(Axi.£'[xi]) ((Ax2.a) ti) 


^nccd 


(Ax2.(Axi.£'[xi]) a) ti 



The first reduction rule substitutes a value for a single variable instance in an abstraction. 
The rule retains the binding and abstraction so as to share its computation with other 
variable references as needed. The second and third reduction rules commute an application 
with an answer binding to expose opportunities for reduction without duplicating not-yet- 
needed computations. These two rules help to ensure that computations will be shared 
among references to a common variable. 



As popularized by iBarendreg 



(Il98ll ). each reduction assumes a hygiene convention. 
When combined with the evaluation contexts, the notions of reduction yield a deterministic 
standard order reduction relation (i — >sr) and its reflexive-transitive closure ( i — ^sr)- 



Definition 2.1. ti 



t2 if and only if ti = E[tr], t2 = E[tc\ and tr ^need 



Terms of the calculus satisfy unique decomposition, meaning that any program (i.e. 
closed term) that is not an answer can be decomposed exactly one way into a context E 
and redex tr- This property ensures that i — >sr is deterministic. Standard order reduction 
is an effective specification of call-by-need evaluation: if t is a program (i.e. closed term), 
then t call-by-need evaluates to an answer if and only if t i — »sr cl for some answer a. 



3. From Reduction Semantics to Machine Semantics 



Some reduction semantics have been shown to correspond directly to abstract machine 
semantics, thereby establ ishing the equivalence of a reducer and a tai l -recur sive abstract 
ma chine implementatio n (iFelleisen and Friedmanl . \WH(i : IFelleisen et ID . 120091 ) . In particu- 
lar, Danvy and Nielsen ( 20041 ) introduces a method and outlines criteria for mechanically 
transforming reduction semantics into abstract machine semantics. However, proceeding 
directly from the reduction semantics for call-by-need to a tail-recursive abstract machine 
semantics poses some challenges that do not arise with other reduction semantics like call- 
by-name and call- by- value. 

A straightforward call-by-need reducer implementation naively decomposes a term into 
a context and a redex. Any application could be one of three different redexes, each of 
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which is nontrivial to detect, so whenever the decompose function detects an apphcation, it 
sequentiahy apphes several recursive predicates to the term in hopes of detecting a redex. 
If the term is a redex, it returns; if not, it recursively decomposes the operator position. 

Call-by-need redexes require more computational effort to recognize than either call- 
by-name or call-by-value. For instance, given a term t, only a fixed number of terminal 
operations are required to detect whether t is a call-by-name redex: one to check if the 
term is an application, one to access the operator position, and one to check if the operator 
is an abstraction. 

Contrast this with the call-by-need redex {Xx.E[x]){{Xy.a)t). Given a call-by-need term 
tx, testing whether it matches this redex form requires an unknown number of operations: 
check if tx is an application; check if its operator position is a lambda abstraction; check, in 
an unknown number of steps, if the operator's body can be decomposed into E[x], where x 
is both free in E and bound by the operator; and finally check, in an unknown number of 
steps, if the operand has the inductive structure of an answer. 

To make matters worse, some terms can be decomposed into the form E[x] in more 
than one way. For instance, consider the term (Ax.(Ay.y) x). It can be decomposed as both 
{Xx.Ei[y]) and {Xx.E2[x]) where Ei = (Ay.D) x and E2 = {Xy.y) □. As such, a recursive 
procedure for decomposing a term cannot stop at the first variable it finds: it must be able 
to backtrack in a way that guarantees it will find the right decomposition E[x] — and in turn 
the right redex — if there is one. 

Recall that one of the evaluation contexts has the form (Xx.E)t. This means that redex 
evaluation can occur "under binders" ( Moggi and Sabry . 2004 : Kameyama et al. . 20081 ). All 
three call-by-need notions of reduction shuffle lambda abstractions about in unusual ways. 
Furthermore, while reducing a recursive routine, a call-by-need evaluator may end up per- 
forming reductions under multiple copies of the same lambda abstraction. Call-by-name and 
call-by- value evaluators can address hygiene concerns by using environments and closures, 
but a call-by-need evaluator must prevent its evaluation context from incorrectly captur- 
ing free variable referenc es. A ny evaluator that goes under lambdas must pay particular 
attention to hygiene 



Since this work was originally publi s hed, 



Danvv et al. ( 2010l ) have adapted and ex- 



tended the method of lDanvv and NielsenI (j2004l ) to produce a related abstract machine for 
call-by-need. 



3.1. Towards an Abstract Machine. To fi nd call-by-need redexes tail-recursively, we ap- 
ply a n insight from the CK abstract machine ( Felleisen and Friedmanl . 1986 : Felleisen et al. 



2OO9I ). The CK machine implements an evaluation strategy for call- by- value based on a re- 
duction semantics using the (inside-out) evaluation contexts □,i?[n t] and E[{Xx.t) □]. To 
find a redex, the machine iteratively examines the outermost constructor of a term and uses 
the evaluation context to remember what has been discovered. Since call-by-value satisfies 
unique decomposition, this approach will find the standard redex if there is one. 

To illustrate this in action, we walk through an example. Consider the program 
(Xx.x) Xy.y. Evaluation begins with configuration {[],(Xx.x) Xy.y). Underlining indicates 
subterms that the machine knows nothing about; at the beginning of evaluation, it knows 
nothing about the entire term. On the first step of the reduction, the machine detects that 
the term is an application (Xx.x) Xy.y. To examine the term further, the machine must 
move its focus to either the operator or operand of this application. Since the machine is tail 
recursive, it must also push an evaluation context to store the as-yet uncovered structure 
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of the term. The only context it can rehably push at this point is [□ Xy.y]'- it cannot push 
[{Xx.x) □] because it has not yet discovered that the operator position is a value. So the 
machine pushes the [□ Ay.y] context, which serves as a reminder that it is focused on the 
operator of an application. 

On the second step of the reduction, the machine detects that the operator is an abstrac- 
tion Xx.x, and observes that the innermost context is [□ \y.y\. In response, the machine 
pops the context, focuses on Ay.y, and pushes the context [{Xx.x) □], since the operator is 
now known to be a value. This context serves as a reminder that evaluation is currently 
focused on the operand of an application that can be reduced once that operand becomes 
a value. 

On the third step, the machine detects the abstraction (Xy.y), and remembers that the 
innermost context is [{Xx.x) □]. At this point, the machine has deduced enough information 
to recognize the redex {Xx.x) Xy.y. This example illustrates how the CK machine uses a 
depth-first left-to-right search strategy to detect call- by- value redexes. 

Now consider the same term under call-by-need using a similar strategy. As with call- 
by-value, the top-level application can be detected, the operand can be pushed onto the 
evaluation context, and the operator can be exposed as the abstraction Xx.x. At this 
point behavior must diverge from call-by-value because the body of the abstraction is still 
unknown and call-by-need does not have [{Xx.t) □] contexts for arbitrary t. However, call- 
by-need does have contexts of the form [(Ax.D) Xy.y]. Therefore, it is possible to proceed 
under the first lambda abstraction, push the context, and focus on x. 

The term is exposed as a variable x, which combines with the context [(Ax.D) Xy.y] 
to form the term {Xx.E[x]) Xy.y (where E = □). At this point, enough information has 
been uncovered to push the context [(Ax.n[x]) □] and focus on Xy.y. The abstraction Xy.y 
is recognized, and with that a call-by-need redex (Ax.n[x]) Xy.y has been found. Success 
with this example suggests a promising strategy for implementing call-by-need reduction 
tail-recursively. 



3.2. An Initial Abstract Machine. In this section, we elaborate the above search strat- 
egy into a simple but inefficient tail-recursive abstract machine. We present it without proof 
and then by a series of correct transformations we derive an efficient machine that we prove 
correct. 

This abstract machine uses the same terms, values, and answers as the calculus. How- 
ever, it introduces two alternate notions. First, t he machine uses a rnore v ersatile rep- 
resentation of evaluation contexts. As observed in iDanvv and NielsenI (|2004l '). evaluation 
contexts can be mathematically specified in more than one way. For optimal fiexibility, we 
define evaluation contexts as lists of frames, where the empty list [ ] and single- frame lists 
[/] are our simple units, and the operator o stands for list concatenation. 



/ ::= at\ {kx.E) □ I {Xx.a) t 

E ::= []\[f]oE\Eo[f] 

where Eo[] = []oE = E 

and El o {E2 o £^3) = [Ei o E2) o E3 



When two contexts are composed, the second context is plugged into the hole of the first 
context: for example [□ t2] o [□ ti] = [(□ ti) ^2]- 
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We call the frame [(Ax.D) t] a binder frame. It represents a variable binding in the 
context. It can be read as [let x = t in □], but we use the former notation to emphasize that 
call-by-need evaluation proceeds under lambdas. This observation motivates our analysis 
of hygiene in Section 14.51 

We call the frame [{kx.E) □] a cont frame, in reference to continuations. The construc- 
tion (kx.E) is called a cont and replaces the metalinguistic term notation {Xx.E[x]) from 
the calculus. We use a different notation for conts than lambda terms to indicate that in 
the machine conts are distinct from terms (they are of type Cont rather than type Term 
in an implementation). Conts indicate nontrivial structural knowledge that the machine 
retains as it searches for a redex. This distinction matters when we establish continuation 
semantics for machine states. As we shall see, a cont frame represents a suspended variable 
reference. 

Finally we call the frame [□ t] an operand frame, and it represents a term waiting for 
an abstraction. 

The abstract machine also introduces a notion of redexes: 



a t I [kx.E) a 



Redexes are distinguished from terms in the machine, meaning that in an implementation, 
the type Redex is distinct from the type Term. This distinction suggests that conts (kx.E) 
are neither terms nor first-class entities in the call-by-need language: they only appear in 
evaluation contexts and in redexes. As we discuss below, the machine distinguishes one 
more kind of redex than the calculus. 

The transition rules for the machine are staged into four distinct groups: refocus, 
rebuild, need, and reduce. Each machine configuration can be related to a term in the 
language of the calculus. The refocus rules examine the current term and push as many 
operand frames [□ t] as possible. A refocus configuration {E,t)j represents the term E[t]. 




Upon reaching a variable, refocus transitions to the need rules; upon reaching a lambda 
abstraction, it transitions to the rebuild rules. 

The rebuild rules search up into the context surrounding an answer for the next appli- 
cable redex. A rebuild configuration {E,a)^ represents the term E[a]. 



{E,a)^ (Rebuild) 



([],«) 



I— 7> a 



{Eo[ah],a)^ ^ {E,ati)^ 
{Eo[{Xxn)h],a)^ ^ {E,{Xx.a)ti)f^ 
{Eio[{KX.E2)a],a)f^ ^ {Ei,{Kx.E2)a)a 



These rules examine the current context and proceed to build a maximal answer-shaped 
term, progressively wrapping binder frames around the current answer. If the entire context 
is consumed then evaluation has completed and the entire program is an answer. Upon 



8 



R. GARCIA, A. LUMSDAINE, AND A. SABRY 



reaching an operand or cont frame, a redex has been fou nd, and rebuild transitions to the 
reduce rules. These rules resemble the refocusaux rules of Danvv and Nielsen ( 20041 ). 

The need rules also examine the context, but they search for the binder frame that 
corresponds to the variable under focus. A need configuration {Ei, E2, x)^ represents the 
term £'i[£'2[x]]. 



{E,E, 




(Need) 


{Eio 


[(Ax 


a)t],E2,x)^ ^ {Eio[{Kx.E2)a],t)f 




{El 


o[f],E2,x)^ ^ {E,,[f]oE2,x)^ 






where, [/] ^ [(Ax.D) t] 



Since input programs are closed, the associated binder must be somewhere in the con- 
text. Upon finding the right binder frame, a cont frame [{kx.E) □] is pushed onto the 
context and evaluation proceeds to refocus on the operand from the associated binder 
frame. 

The reduce rules simulate the notions of reduction from the calculus. A reduce config- 
uration {E,r)^ represents the term E[r] where a cont kx.E represents the term Xx.E[x]. 





(Reduce) 






{Ei,{kx.E2) v)a ^ 


{Ei,{Xx.E2[v]) v)f 


{El, {KX1.E2) {{\x2.a) t))^ ^ 


{Ei,{Xx2.{Xxi.E2[xi]) a) t) ^ 




{E,{Xx.a) tit2)^ ^ 


{E, {Xx.at2) 




{E,{Xx.ti)t2)^ ^ 


{Eo[{Xxn) t2],ti)j 



Each of the first two reduce rules transforms a cont into a lambda abstraction by 
plugging its context with a term and abstracting its variable. As such, each reduce rule 
transforms a redex into a pure term of the calculus and transitions to a refocus configuration, 
which searches for the next redex. 

The reduce rules also handle terms of the form (Xx.ti) t2, even though such terms are 
not call-by-need redexes. Including this rule gives the set of redexes greater uniformity: all 
terms of the form a t are redexes, just like the terms of the form (kx.E) a. This sy mmetry 
is not exhibited in the call-by-need calculus. However, Ariola and FelleisenI (jl997l ) defines 
and uses an auxiliary let calculus that adds the reduction 

(Xx.ti) t2 — >need let X = t2 in ti 

to the calculus and defines the other reductions in terms of the let expressions. The fourth 
reduce rule corresponds to this reduction rule. However, our presentation shows that an 
auxiliary let term, though compatible with this model, is not needed to specify call-by-need: 
the syntax of pure calculus terms suffices. Furthermore, treating this rule as a reduction 
here anticipates a change we make to the machine (Section 14. 5p that adds explicit variable 
renaming to that rule. Finally, the reduce rules are improved in the next section so that all 
reduce rules change their representative terms nontrivially. 

Machine evaluation of a program t begins with the refocus configuration {[],t)j and 
terminates if it arrives at an answer a. Its behavior in between can be summarized as follows: 
search downwards until a value or variable reference is reached. If a variable reference is 
reached, store a cont in the context to remember the variable reference and proceed to 
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evaluate its binding. If an abstraction is reached, accumulate an answer up to the innermost 
redex, or the top of the evaluation context if none is found. In short, the machine performs 
a depth-first, left-to-right traversal in search of a call- by- need redex. Along the way it 
uses the evaluation context to store and retrieve information about program structure, 
particularly the location of variable bindings (using binder frames) and variable references 
(using cont frames). The refocus, rebuild, and need rules leave the term representation of 
their configurations unchanged (e.g. if {Ei,ti)j i-^ (-£^21^2)^ then Ei[ti] = E2[t2]), and the 
reduce rules embody the notions of reduction from the calculus. 



Our strategy for producing this machine builds on the strategy of iDanvy and Nielsen 



which mechanizes the direct transformation of reduction semantics into abstract ma- 
chine semantics. That report introduces and verifies a general method for using reduction 
semantics that meet certain criteria to construct a function that "refocuses" an arbitrary 
term-context pair to a redex-context pair. The resulting function can then be used to induce 
an abstract machine semantics. Unfortunately that refocus function construction does not 
apply to the call-by-need lambda calculus because the calculus does not meet the required 
criteria. In particular, the construction requires that a maximally-decomposed closed term 
(i.e. program) will focus on a value or a redex. However, call-by-need evaluation contexts 
can decompose down to variable references, which are neither redexes nor values under 
call-by-need. There are however other ways to produce an abstract machine from a reduc- 
tion semanti c s whi ch may apply to the call-by-need calculus studied here. For instance, 
Danvv et all (|2010l l devise a variant of the let-based call- by-need reduction semantics, im- 



plement it, and use a program-transformation based approach to produce a refocus function 
and abstract machine implementation. 

The following partial trace demonstrates how the initial abstract machine discovers the 
first redex for our running example (Xx.x) Xy.y: 

{[ ], (Xx.x) Xy.y ) ^ ^ ([□ Xy.y ], Xx.x ) j. ^ {[D Xyjj], Xx.x)^ 

^ ([],(Ax.x) Xy.y)^ ^ {[{Xx.n) Xy£,x) ^ ^ ([(Ax.D) Xy.y],[],x)^ 

^ ([{Kx.[])n],Xy.y)^ ^ {[iKx.[])D],Xy.y)^ ^ {[], {kx.[]) Xy.y)^ 



4. Refining the Machine 

In this section we study the behavior of the abstract machine and make some improve- 
ments based on our observations. These changes lead us from the initial machine above to 
our final machine specification. 

4.1. Grabbing and Pushing Conts. The need rules linearly search the nearest context 
for a binder frame that matches the variable under question. This process can be specified 
as one step: 

{El o [{Xx.a) t] o^2,x)„ ^ {El o [{KX.E2) a\,t)f 

where [(Ax.D) t] ^ E2 

This evaluation step accumulates a segment of the current evaluation context and stores it. 
In general, abstract machines that model control operators represent control capture in a 
similar manner. In this particular case, only part of the evaluation context is captured, and 
the amount of context captured depends on the dynamic location in the context of a certain 
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frame. As such, the need rules seem to perform some kind of delimited control capture. This 
analogy becomes stronger upon further analysis of the first reduce rule from Section 13.21 
The machine uses its structural knowledge of nx.E to construct the abstraction Xx.E[v]. 
However, the resulting machine configuration no longer retains any of the structure that 
had previously been discovered. Recall our example execution trace from Section [3. 2[ The 
machine reduces the redex found at the end of that trace as follows: 

([ ], (kx.[ ]) Xy.y)^ ^ {[ ], {Xx.Xy.y) Xy.y ) ^ 

By returning to refocus following the reduction, the machine loses all structural knowledge 
of the term. To continue execution, it must examine the structure of the contractum from 
scratch. Fortunately, the evaluator can be safely improved so that it retains knowledge of 
the contractum's structure: 

Proposition 4.1. 

(^1, {Xx.E2[v]) v)f ^ {El o [{Xx.a) v] o E2,v)^ 

Proof. Corollary of {Ei, E2[v]) j i — » {Ei o E2,v)f^, which is proven by induction on E2. □ 

This proposition justifies replacing the first reduce rule with one that pushes the eval- 
uation context embedded in the cont and proceeds to rebuild an answer: 

{El, {KX.E2) v)a ^ {El o [{Xx.a) v] o E2,v)f, 

This short-circuit rule extends the current evaluation context with a binder frame and the 
context E2 that was inside the cont. The rule is suggestive of delimited control because 
machine models of control operators generally represent the reinstatement of delimited 
continuations by extending the current context with a piece of captured evaluation context. 
Of more immediate interest, though, is how reduction of our example now proceeds: 

{[],{KX.[])Xy.y)^^{[{Xx.D) {Xy.y)], Xy.y)^ 

All knowledge of the contractum's structure is retained, though much of it is now stored in 
the evaluation context. 

4.2. Shifting Binder Frames. The second and third reduce rules from Section 13.21 also 
discard structural information. Specifically, they both transition to the forgetful refocus 
rule. However their information can be preserved. 

Proposition 4.2. 

{E,{Xx.at2) ti)f ^{Eo[{Xx.D) ti],at2)rf. 
Proof. Corollary of {Ei,a) ^ 1 — » (£'i,a)^, which is proven by induction on a. □ 

Proposition 4.3. If E2 does not capture xi (Section \4.5\ ), then 

{Ei,{Xx2.{Xxi.E2[xi]) a) t) ^ ^ {Ei o [(Axa-D) t],{KXi.E2) a)^. 
Proof. Corollary of {Ei,a)j 1 — » {Ei,a)^ and 

{Ei,{Xxi.E2[xi\) t)j I — » {El o [{k,xi.E2) D],t)j, which is proven by case analysis and in- 
duction on E'2. □ 
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These propositions justify short-circuiting the respective evaluation rules. The new 
rules improve the behavior of the abstract machine. 

(Si, {KX1.E2) {{Xx2.a) t))^ ^ {El o [{\x2.a) t], {KX1.E2) a)^ 

{E, (Ax.a) h t2)d ^ {E o [(Ax.D) h],a ta)^ 

By fast-forwarding to reduce, the rules retain the discovered term structure and thereby 
avoid retracing the same terms again. 

4.3. Answer = Binders x Value. The transition rules repeatedly create binder frames 
out of terms and reabsorb those frames into answers. In this section we simplify this 
protocol. We distinguish answers from terms by providing them a separate representation: 



lE,v^, where E = [(Axj.D) t. 



An answer is now represented as a tuple containing the nested lambda abstraction and 
the sequence of binder frames that are wrapped around it in the old presentation (we use 
overlines to indicate s equences). This pr esentation bears strong similarity to calculi with 



explicit substitutions (lAbadi et al.l . Il99ll ) in that each binder frame [(Ax.D) t] corresponds 



to a substitution [t/x]. An answer can be seen as a lambda term nested inside a sequence 
of explicit substitutions, v[ti/xi]. 

The rebuild rules could be reformulated as a three place configuration, {E,E,v)f^, but 
instead we immediately apply the same improvement that we applied to the need rules in 
Section [4.11 For instance, the new transition rule for rebuilding to a cont frame is: 

{El o [{KX.E2) □] o E3, v)^ ^ {El, {KX.E2) 1^3, ^Drf 



where £'3 = [(Axj.D) U] 

Returning to our running example, reduction from its most recent state (at the end of 
Section 14. ip transitions to a final answer, signaling the end of execution: 

([(Ax.D) Ay.y],Ay.y>. ^ ([[[(Ax.D) \y.y\,\y.y}) 



4.4. Aggregate Reduction. Now that answers explicitly carry their binders in aggregate, 
the reduce rules can be substantially consolidated. Currently, the second and third reduce 
rules iteratively remove the top binder frame from an answer and push it onto the evaluation 
context. This process repeats until the answer is just a lambda abstraction. At that point, 
the second and third reduce rules defer to the first and fourth reduce rules respectively. 
This corresponds exactly with standard-order reduction (cf. Definition 12. ip : 

Proposition 4.4. 

E[{{\Xn. ■ ■ ■ {{\XI.{{\XQ.V) to)) h) ...) tn) t] 
E[{{XXn. ■ ■ ■ {{\XI.{{\XQ.V t) to)) ti) ...) tn)]. 

E[{Xx.E[x]) {{XXn- . . . {{Xxi.iiXxo.v) to)) ti) ...) tn)] ^sr 

E[{{Xxn. ■ ■ ■ {{Xxi.{{Xxo.{Xx.E[x]) v) to)) ti) ...) tn)]. 



Proof. B y induction on the st r uctur e of the answer term, using the unique decomposition 
lemma of Ariola and Felleisen ( 19971 ). □ 
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Using the new answer representation, each pair of associated reduce rules can be merged 
into one omnibus rule that moves all the binder frames at once and simultaneously performs 
a reduction using the underlying value. 

(^1, {nx.E2) p3,t'|)rf ^ (^1 ° ^3 o [(Ax.D) 1,] o E2,v)j^ 

{El, IE2, {Xx.h)W t2)d ^ {El o E2 o [{Xx.D) t2],ti)f 

As a result of these transformations, both conts and answers contain evaluation con- 
texts. Furthermore, conts and answers are not terms of the calculus, and the machine never 
reverts a cont or answer to a term. The rules that create them, rebuild for answers and 
need for conts, capture part of the evaluation context, and the rules that consume them, 
the reduce rules, reinstate the captured contexts. 

4.5. Variable Hygiene. Presentations of calculi often invoke a hygiene convention and 

from then on pay little attention to bound or free variables. In this manner, calculi do 
not commit to any of the numerous ways that hygiene can be enforced. Many abstract 
machines, however, use environments or explicit sources of fresh names to guarantee hygiene 
and thereby provide a closer correspondence to concrete implementations. In this section, 
we augment the call-by-need machine with simple measures to enforce hygiene. 

Our primary hygiene concerns are that evaluation occurs under binders and binders 
are shifted about in unusual ways. In order to ensure that binding structure is preserved 
throughout evaluation, we need to be able to reason locally, within each machine config- 
uration, about bound variables. To make this possible, we make one last change to the 
machine. We add a list of names to each machine configuration. 

X ::= xi 

Most of the machine rules simply pass the list of names along to the next transition. One 
of the reduce rules manipulates the list of names. 

{X\Ei,lE2,\x.h]\t2)d^nam {X , x' \ Ei O E2 o [{Xx' H) t2], ti[x' / x]) ^ x' ^ X (D.2) 

When this rule goes under a lambda, it adds the name of its bound variable to the list X 
of variables. The notation X, x expresses adding a new name x to X. If the bound variable 
X on the left hand side of the rule is already a member of X, then the variable is renamed 
as part of the transition. As such, X can be considered a set. 
Now each machine configuration has one of five forms: 

{X\E,?) ■■= {X\lE,v]\)\{X\E,r)a\{X\E,t)f 
I {X\E,v),\{X\E,x)^ 

We use the notation {X \ E, ?) below to uniformly discuss all configuration types, where 
X refers to the list of names, E refers to the context, and ? refers to the term or redex. 
For a final configuration {X \ [[i?, ?;|), ? refers to the answer's underlying value v, and E 
corresponds to the answer's binder frames E. We use the metavariable C to range over 
configurations when the internal structure does not matter. 
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The call-by-need abstract machine uses the set X of names to keep track of active 
variables: any variable x whose binding instance has been pushed into a binder frame 



Cont-bound variables are counted among the active variables because machine evaluation 
must have gone under a binding to construct the cont frame. 

The renaming condition on the {D.2) reduce rule ensures that active variables are 
mutually distinguishable. This guarantees that the machine's need rule can never capture 
the wrong evaluation context and thus execute the wrong bound expression. 

Renaming is not obviously sufficient to ensure bound variable hygiene because of how 
the machine manipulates evaluation contexts. For instance, even though the need rule is 
guaranteed to only match a variable with the right binder frame, we have no guarantee 
that the right binder frame could never be trapped inside a cont frame and hidden from 
view while a need transition searches for it. Were this to happen, the machine would get 
stuck. Furthermore, the reduction rules flip and shift evaluation contexts that might contain 
binder frames. If a binder frame were to end up below another context frame that contains 
references to its bound variable, then those references would no longer be bound in the 
context; the need rule would exhaust the evaluation context if it attempted to resolve any 
of these references. 

To verify that machine evaluation is well-formed, we establish well-formedness condi- 
tions that suffice to ensure safe evaluation and we show that they hold for evaluation of 
all programs. The well-formedness conditions rely on straightforward notions of captured 
context variables {CV) and free context variables {FV): 



[{Xx.U) t\: 



AV{[]) 

AV{[{\xn) t\oE) 
AV{[n t] o E) 
AV{[{kx.Ei) □] oi 



= 

= { .T } U AV{E) 
= AV{E) 



E) = AV{Ei)u{x}uAV{E) 



CV{[]) 

CV{E o [{Xx.U) t\) 
CV{Eo[Ut]) 
CV{Eo[{kx.Ei) □]) 





{x}v^cv{E) 

CV{E) 
CV{E) 



FV{[]) = 

FV([{\xn) t]oE) = FV{t) U {FV{E) - { x }) 

FV{[nt]oE) = FV{E)UFV{t) 

FV{[{KX.Ei)n]oE) = FV{E)U{FV{Ei) - {x}) 



As expected, these two notions are related. 



Lemma 4.5. 



(1) FV{Ei) C FV{EioE2). 

(2) CvIei o E2) = CV{Ei) U CViE2). 

(3) FV{Ei o E2) = FvIEi) U {FV{E2)\CV{Ei)). 



Proof. By induction on the length of Ei, E2, and Ei respectively. 



□ 
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To establish that binder frames remain properly positioned, we define a notion of well- 
formed evaluation contexts: 

Definition 4.6. A Machine context/name pair is well formed, notation X \ E wf, iff: 

(1) FV{E) = 0; 

(2) AV{E) = {x\x occurs in X } 

(3) Each active variable of E is distinct: ii EioE^^ E then AV{Ei) r\AV{E2) = $, and 
if [{kx.Ei) U]eE then x ^ ^^(£^1). 

These well-formedness criteria ensure that a context has no unbound variable references, 
that active variables cannot interfere with each other, and that X is simply a particular 
ordering of the E^s active variables. The captured variables of E are also distinct since 
every captured variable is an active variable. 

Furthermore, machine configurations also have a notion of well-formedness: 



X I £; wf FV{1) C CV{E) 

{X\E,1)^ wf '^'^ 

X\{Eio [{KX.E2) □] o E3) wf FV{v) C CV{Ei o [{KX.E2) □] o E^) 
{X\E^,{kx.E2) lE^,v\^)^ wf 

X I (El o [□ t] o E2) wf FV{v) C CV{Ei o [□ t] o E2) 
{X\E^,lE2,v\\t2)a wf 



These well-formedness conditions ensure that the evaluation contexts E are well-formed, 
that the list of names X matches the active variables of E, and that the free variables 
of the term under focus are captured by the context. To account for redexes, the well- 
formedness conditions for each reduce configuration reflect the well-formedness conditions 
for the corresponding rebuild configuration. As shown below, well-formed reduce config- 
urations {X I E, r)^ wf ensure that reduce rules can be safely performed without implicit 
renaming. 

Well-formedness of the reduce configurations ensures that the reduce transitions require 
no implicit bound-variable renaming to preserve hygiene. Well-formedness of the need 
configuration guarantees that it cannot be "stuck": since x G CV{E), a well-formed need 
configuration always has a binder frame [(Ax.D) t\ that matches the variable under focus, 
so the configuration can transition. 

Well-formedness of configurations combined with rule D.2's name management ensures 
that machine evaluation respects variable binding structure. 

Theorem 4.7. Ift is a closed term of the calculus, then (0 | [],t) j wf. 

Proof I [ ] wf and FV{t) C CVi[ ]) = 0. □ 

Theorem 4.8. Let Ci and C2 be configurations. If Ci wf and C\ 1 — ^nam C2 then C2 wf. 

Proof. By cases on 1 — >nam ■ The cases are immediate except when Ci 1 — >nam C2 by a 
reduce rule. The proofs for both kinds of reduce configurations are similar, so we present 
only one of the cases: 

Case {Ci = {X\Ei, {KX.E2) [[^3, vf]^). 
By definition, C\ 1 — ^nam 

C2 = {X\ E, v)^, where E = o o [(Ax.D) v] o E2. Since the 



LAZY EVALUATION AND DELIMITED CONTROL 



15 



transition rule introduces no new active variables, X should remain the same. Furthermore, 
all the active variables remain distinct, though x is now introduced by the binder frame 
[(Ax.D) v] rather than the cont frame [{KX.E2) □]. It remains to show that FV{v) C CV{E) 
and that FV{E) = 0. 

First, since Ci wf, it follows by inversion that X \ (Ei o [(KX.E2) □] o £^3) wf and 

FV{v) C CV{Ei o [{KX.E2) □] o Es) = CV{Ei o E^) C CV{E). 

By LemmaSSl FV{Ei) = FV{Ei o [{KX.E2) □]) = FV{Ei o [{KX.E2) □] o E3) = 0. 

Since Cy([(Kx.^2) □]) = 0, it follows from Lemma that FViE^) C CViEi) and 
from the definition of FV that FV{E2) Q CV{Ei) U {x}. From these it follows that 
FV{E) = 0. 

□ 

In short, well-formedness of the reduce configurations ensures that the reduce rules 
can be safely performed without any implicit renaming. Since the machine preserves well- 
formedness, this property persists throughout evaluation. The rest of this paper only con- 
siders well- formed configurations. 

4.6. An Abstract Machine for Call-by-need. Putting together our observations from 
the previous section, we now present the specification of the abstract machine. Figure [J 
presents its 1 — >nam transitions rules. We have derived a heap-less abstract machine for 
call-by-need evaluation. It replaces the traditional manipulation of a heap using store-based 
effects with disciplined management of the evaluation stack using control-based effects. In 
short, state is replaced with control. 

Machine evaluation of a program t begins with (0 | [],t)j and terminates at {X \ ^E, u JJ). 

5. Correctness of the Machine 

The previous section proves that the machine manipulates terms in a manner that 
preserves variable binding. In this section, we prove that those manipulations correspond 
to standard-order call-by-need evaluation. 

To proceed, we first establish correspondences between abstract machine configura- 
tions and call-by-need terms. As we have alluded to previously, abstract machine contexts 
correspond directly to calculus contexts: 



CM 


= □ 


cm t]oEj 


= cmt 


ci[{kx.Ei) a]oE2j 


= iXx.ClE^}[x]) ClE2j 


cliiXxn) t]oEj 


= iXx.ClE})t 



Redexes also map to call-by- need terms: 



CllE,v^ tj 


= icmy])t 


CUkx.E,) lE2,vM 


= {Xx.ClE,j[x]) {ClE2j[v]) 



Given that terms map identically to terms, configuration mapping is defined uniformly: 

ci{x\E,?)} = cmim] 
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{X I r)^ (Reduce 



{D.l) {X\E,,{KX.E2)lE^M)d 
[D.2) {X\E^,lE2,XxM\^t2)^ 



{X\EioEzo[{Xxn) v]oE2,v)^ 

(X, x'\EioE20 [{Xx'.D) t2],ti[x'/x])j. x' iX 



{X \E,t)f (Refocus) 



(Rl) {X\E,x)f 
{F.2) {X\E,\x.t)^ 
(F.S) {X\E,ht2)f 



{X\E,x)^ 
{X\E,\x.t)^ 
{X\Eo[U t2],ti)f 



{X\E,v)^, (Rebuild) 



(B.l) {X\E,,v), 
iB.2) {X\Eio[Dt]oEb,v)^ 
(B.3) {X\Eio[{Kx.E2)n]oEb,v)^ 
where Ei, — [{Xxi.D) ti] 



{X\lE,,v^) 
{X\E,,lEi„v^ i), 
{X\Ei,{kx.E2) lEb,v^)„ 



{X\E,x)^^ (Need) 



(A^.l) {X\Eio[{Xxn)t]oE2,x)^ 
where [(Ax.D) t] i E2 



l,X\E^o\{Kx.E2)U\,t). 



Figure 1: Call- by- need Machine 



Since the calculus is defined over alpha equivalence classes, we reason up to alpha equivalence 
when relating terms to machine configurations. 

We now state our fundamental correctness theorems. First we guarantee soundness, 
the property that every step of the abstract machine respects standard-order reduction. 

Theorem 5.1. If t\ = C[[Ci]] and Ci 1 — >nam C2, then ti 1 — »sr t2, for some t2 = C\C2\- 

Proof. By cases on 1 — >nam ■ Only rules D.l and D.2 are not immediate. The other rules 
preserve equality under C[C|. □ 

Corollary 1 (Soundness). 

Ift = CIC\ and C ^nam {X 1 |[-B,uJJ), then t a, for some a = Cl{X \ lE,v^)j. 

Proof. By induction on the length of the 1 — »nam sequence. □ 

We also prove completeness, namely that abstract machine reduction subsumes stan- 
dard order reduction. 

Theorem 5.2 (Completeness). 

Ift = ClCj and t a, then C ^nam {X \ lE,vW), with a = Cl{X \ lE,v^)j. 

Proof. This proof proceeds by induction on the length of 1 — »sr sequences. It utilizes 
Proposition 14.41 to accelerate the 1 — >sr rules in accordance with 1 — >nam- It also relies on a 
number of lemmas to establish that 1 — >nam will find the unique redex of a term from any 
decomposition of a term into a context E and a subterm t. □ 



LAZY EVALUATION AND DELIMITED CONTROL 



17 



Theorem 5.3 (Correctness). Ift = ClC}, then 

t I— a if and only if C i-^nam {X \ |[^, t;JJ) 
witha = Cl{X\lE,v^)i. 



5.1. Discussion. This abstract machine has nice inductive properties. The refocus rules 
always dispatch on the outermost term constructor. The rebuild and need rules dispatch 
on a prefix of the context, though each has difi^erent criteria for bounding the prefix. 

The abstract machine's evaluation steps should not be seen as merely a desperate search 
for a redex. Rather, the machine exposes the fine- grain structure of call-by-need evaluation, 
just as the CK machine and the Krivine machine ( Krivinel . 200?! ) model evaluation for call- 
by-value and call-by-name respectively. Answers are the partial results of computations, and 
the rebuild rules represent the process of reconstructing and returning a result to a reduction 
site. Furthermore, the need rules can be viewed as a novel form of variable-lookup combined 
with lazy evaluation. The evaluation context captures the rest of computation, but not in 
order: variable references cause evaluation to skip around in a manner that is difficult to 
predict. 

The way that variables behave in these semantics reveals a connection to coroutines. 
The reduction rule D.2 binds a variable to a delayed computation; referencing that variable 
suspends the current computation and jumps to its associated delayed computation. Upon 
completion of that computation, any newly delayed computations (i.e. binder frames) are 
added to the evaluation context and the original computation is resumed. 

The standard-order reduction relation of the call-by-need lambda calculus defines an 
evaluator concisely but abstractly. Surely unique decomposition, standardization, and hy- 
giene ensure the existence of a deterministic evaluator, but these properties do not spell out 
the details or implications. Based on a reasoned inspection of standard-order reduction, we 
expose its computational behavior and capture it in a novel abstract machine that has no 
store. The improvements to the initial machine produce a variant that effectively assimi- 
lates computational information, explicitly accounts for variable hygiene and thereby reveals 
coarse-grained operational structure implicit in call-by-need standard-order evaluation. 



5.2. Extensions. The machine presented above describes evaluation only for the pure 
lambda calculus. In this subsection, we introduce some features that are typical of prag- 
matic programming languages. 



5.2.1. Let binding. To help with a proof that the call- by-need calculus can simulate call- 
by-name, Ariola and Felleisen introduce a lei-based calculus. The let-calculus adds the 
construction let x = t in t to the set of terms and considers a new axiom: 

(Ax.ti) t2 = let x = t2 in ti 

The let-calculus is formulated by taking this axiom as a reduction rule running from left to 
right, and reformulating the original three axioms of the call-by- need calculus in terms of 
let expressions. These new axioms can be justified by proving that they are derivable from 
the original three axioms and the new let axiom. 

One approach to producing an abstract machine that supports let is to repeat the 
entire process described in this section, but focusing on the let-calculus instead of the 
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lambda calculus. However, let-binding can be retro-fitted to the current lambda calculus 
much more simply by reading the new let axiom as a right-to-left reduction rule: 

let X = t2 in ^1 — ^-need (Ax.ti) t2 

Essentially, this rule indicates that let bindings in the calculus can be understood as equiv- 
alent to an immediate application of an abstraction to a term. 

The consequences for the naive machine are the addition of a new kind of redex: 

r ::= . . . I let X = t2 in ti 

and a new refocus rule: 

{E, let X = t2 in ti) ^ i-)- {E, let x = t2 m ti)^ 

and a new reduce rule: 

{E, let x = t2 in ti)^ {E, (Xx.ti) t2) j 

Then, in the process of improving our machine, the reduce rule can be fast-forwarded to 
reduce the introduced abstraction and application immediately: 

{X \ E,let x = t2m ti)^ ^ {X,x'\Eo [(Ax'.D) t2],ti[x' /x]) ^ 

This process confirms that the let expression form can be comfortably treated as a 
conservative extension of the call-by-need lambda calculus. Its addition does not force a 
radical reconstruction of the abstract machine. As we show in Section [5.41 adding a circular 
letrec construct to the language fundamentally alters the system. 



5.2.2. Constants. Plotkin augments the lambda calculus with two sets of constants, 

the basic constants h and the function constants /. In that language, the basic constants 
are observables and serve as placeholders for real programming language constants like 
numbers, strings, and so on. The function constants serve as placeholders for real primitive 
functions, generally over basic constants. The function constants are not first-class values 
in the language, but instead appear as operators in primitive function expressions. 



t ::= .. 


■\b\ft 


V ::= .. 


■ 1 b 



Function expressions always take a single term argument. 

The calculus is parametrized on a partial function 5 that maps function-value pairs to 
values. As is standard, the notions of reduction for the calculus are augmented to handle 
function constant applications. 



f Vl ^need V2 H 5{f,Vi) =V2 



Since the notions of reduction for function constants require them to be applied to values, 
the calculus must account for answers. To handle this, the calculus can be extended with 
a rule to commute answer bindings with function expressions, as is done for application 
expressions: 



/ ((Ax. a) t) ^-need (Ax./ o) t 



If function expressions were only defined for basic constant arguments, then answer bind- 
ings could be discarded instead of commuted. However this approach imposes an ad hoc 
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limitation on the semantics. It would be cleaner and more orthogonal to uniform ly handle 
garbage collection with a specific notion of reduction ( Ariola and Felleisenl . 19971 ) . 

Function constants must be considered in the evaluation contexts. A function constant 
must force its argument in order to produce a value, so upon encountering a function 
constant application, evaluation must continue in its argument: 



E 



fE 



The other notions of reduction remain exactly the same, but since basic constants are 
values, they are subject to rules that manipulate values and answers. For example, if the set 
of basic constants includes numbers and the set of function constants includes operations 
on numbers, then by the deref rule: 

(Ax.addl x) 5 — ^-nced (Ax.addl 5) 5 

The abstract machine requires few changes to accommodate these additions. The set 
of redexes is extended to include function expressions: 

r ::=... I / a 

The refocus rules are generalized to create context frames for function constant applications 
and to rebuild when any value, lambda abstraction or basic constant, is encountered: 

{X\E,ft)f ^nara {X \ E O [f U],t) f 
iV) J I ^nam {X\E,v)fj 

Furthermore, rebuild must account for function constant applications: 

{X\Eo[fU]o Et, V)f ^nam {X\E,f lEf,, V^)^ 

Finally, the reduce rule for / a, pushes binder frames upwards and appeals to the delta rule 
S{f,v) for its result: 

{X\E,f lEb, V^)a ^nam {X \ E O E^, S{f, v)), 

Since 5{f, v) yields only values, the reduce configuration for / a can immediately transition 
to rebuild from this result. 



5.3. Lazy Constructors. As pointed out by Ariola and Felleisen (jl997l ). the call-by-need 
lambda calculus can be easily extended with support for lazy constructors. The rules for 
constructors and destructors can be inferred from the church encoding for pairs: 



cons 


= Xxi.Xx2.Xd.d xi X2 


car 


= Xp.p {Xxi.Xx2-Xi) 


cdr 


= Xp.p {Xxi.Xx2-X2) 



To add support for lazy pairs, we first extend the syntax of the language: 



t 




cons 1 1 car t cdr t {x, x) 


V 




{x,x) 


E 




car E cdr E 



The cons ti t2 expression creates a lazy pair of its two arguments, ti and t2- The car t 
and cdr t expressions evaluate their respective arguments and extract the first or second 
element of the resulting pair. As with function constants in the previous section, the lazy 
pair constructor and destructors are second-class. To express first-class constructors, these 
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may be eta-expanded into, e.g., Aa;i.Ax2.cons xi X2- The (xi,X2) value is a representation 
of a lazy pair. It contains variables that refer to shared computations. 

According to the evaluation contexts, evaluation may focus on the argument of a car 
or cdr expression. However, evaluation never directly operates on the arguments to a 
constructor. This is why the evaluation contexts for the language do not examine the 
arguments to cons. As a direct consequence, the standard call-by-need reduction rules will 
never substitute a value for a variable inside a pair. A variable inside a pair can only be 
evaluated after decomposing the pair using car or cdr. 

The Church encoding of pairs motivates the following notions of reduction: 



cons ti t2 


^need 


{XXI.{XX2. {XI,X2)) t2) h 


car {xi,X2) 


^need 


Xl 


cdr (xi,X2) 


^need 


X2 



These rules model cons as it is applied to two arguments. The car and cdr operations each 
expose a previously inaccessible variable reference to evaluation. 

To accommodate lazy pairs in the abstract machine, we extend the set of redexes: 

r ::= . . . [ cons ti t2 \ car a \ cdr a 

and we introduce several new rules. The first set of rules extends the refocus stage of the 
machine to handle our extensions. 



{X \ E, cons ti ^2)/ 
{X \ E , car t) J- 
{X \E,cdz t)f 



{X \ E, cons ti t2)d 
{X\Eo[ca.r D],t)f 
(X|£;o [cdr a],t)j^ 



A cons expression is immediately ready for reduction. The car and cdr expressions proceed 
to evaluate their respective arguments. Another set of rules returns an answer to car or 
cdr. 



{X\Eo[ca.r n]oEh,v)i, 
{X\Eo [cdz a]oEb,v)i, 



{X\E,ce.riE,,vW)^ 
{X\E,cdr \lEh,v]\), 



Finally, a set of rules corresponds to the notions of reduction. 



{X I E, cons ti t2)ci 
(X|£i,car lE2,{xi,X2)^)d 
(X I £2, cdr lE2,{xi,X2)fia 



{X, xi,X2\E o [(Axi.D) ti] o [(Aa;2.n) ^2], {xi,X2))h 

{X\EioE2,xi)^ 

{X\EioE2,X2)^ 



Lazy pair construction creates new binder frames for the two terms and produces a pair that 
references them. To evaluate a destructor, the binder frames associated with the answer 
are pushed upwards and the corresponding variable reference is extracted from the pair and 
the value of its associated computation is demanded. 



5.4. Circularity. As pointed out by Ariola et al. ( 19951 ). the presence of constructors 
makes it possible to observe duplicated constructors when the Y combinator is used to 
express recursion. 

Consider the expression: 

y(Ay.cons I y) = (A/.(Ax./ (x x)) (Ax./ (x x))) (Ay. cons 1 y) 
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Standard-order reduction in the calculus proceeds as follows, underlining either the 
active variable reference or the current redex: 

(A/.(Ax./ {x x)) (Ax./ (x x))) (Ay. cons 1 y) 

-^need (A/. (Ax. (Aj/. cons 1 y ) (x x)) (Ax./ (x x))) (Ay. cons 1 y) 

->need (A/. (Ax. (Ay . (Axi . (Ax2 . (xi,X2)) y) 1) (x x)) (Ax./ (x x))) (Ay. cons 1 y) 

Because of the two references to / in Y, the term (Ay. cons 1 y) is copied and ultimately 
recomputed each time the term is needed. Ideally a recursive (x, x) value could refer to 
itself and not recompute its value whenever its cdr is demanded. Duplicated computation 
does not corresp ond to how recursion and lazy constructors interact in typical semantics for 
lazy evaluation (jHenderson and Morrisl . Il97fil ). An implementation would create a single 
self-referencing cons cell. 

Because of the scoping rules for A, there is no way to explicitly define truly circular 
structures in this calculus. To address this, Ariola and Felle i sen introdu ce a /eirec-based 
calculus, inspired by the circular calculus of Ariola and Klop ( 19941 . 19971 ). The syntax of 
the letrec-calculus follows: 



t : 


:= X Ax.t \ tt\ letrec Dint 


D : 


:= xi be ti, . . . , Xn be 


V : 


:= Xx.t 


a : 


:= V letrec D in a 


E : 


:= n \ Et \ letrec D \n E 




letrec D,xbe E in E[x] 




letrec x„ be E, D[x, x„] in E[x] 


D[x,Xn] : 


:= X be E[xi],xi be -E'[x2], • • • , Xn-i be E[xn], D 



The letrec-calculus is similar to the let-calculus in how it adds an explicit binding form, but 
each letrec expression can contain an unordered set D of mutually recursive bindings x be t 
for distinct variables. 

The evaluation contexts for the letrec-calculus resemble those for the let-calculus. The 
contexts letrec D,x be E in E[x] and letrec x„ be E, D[x, x„] in E[x] represent evaluation tak- 
ing place in the binding position of a letrec expression. The first form expresses that a 
variable has been referenced in the body of the letrec and that variable's definition is cur- 
rently under evaluation. The second form expresses that some variables bound in the letrec 
form depend on each other, and a variable reference in the body of the letrec has forced 
evaluation of this chain of variable references. This chain of dependencies is denoted by the 
syntax Z)[x,x„]. The definition of the last referenced variable in the chain of dependencies, 
x„, is currently being evaluated. For evaluation to proceed, all the variables in a chain of 
dependencies must be disjoint. The letrec calculus regards a cyclic dependency chain as a 
diverging computation, a stuck expression that is not a valid answer. 
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The notions of reduction for the letrec calculus follow: 



(1) 


(Axil) t2 




^need 


letrec a; be i2 in ti 


(2) 


letrec D,xhev in E{x] 




^need 


letrec D,x be v, in E[v] 


(3) 


letrec .t„ be v, D[x, Xn] in 


E[x] 


^need 


letrec Xnbe v, D[x,v] in E[x] 


(4) 


(letrec D \n a) t 




^need 


letrec £) in a i 


(5) 


letrec Xn be (letrec D in a 


),D[x,Xn] 


^need 


letrec D, x„ be a, ^[x, x„] in i?[x] 




in E[x] 








(6) 


letrec Di,xbe (letrec D2 


in a) in E[x] 


^need 


letrec I?2, -Di , x be a in E[x] 



The rules operate as follows. Rule (1) is analogous to the equivalent let-calculus rule. 
Rule (2) is analogous to the basic dereference rule. Rule (3) resolves the last variable 
reference in a chain of dependency. The syntax Z)[x,f] expresses replacing x„_i be E[xn] 
with x„_i be E[v] in the list of bindings. Rules (4) through (6) are associativity rules that 
percolate bindings upward to ensure proper sharing. Rules (5) and (6) are important for 
recursion. They lift recursive bindings from the definition of a letrec-bound variable x and 
incorporate them into the same letrec expression that binds x. In essence they expand the 
set of variables bound by the outer letrec expression. 

Using the letrec-calculus, the corresponding example using cons is as follows: 

letrec y be cons 1 y\n y 

^nccd letrec y be (letrec xi be 1 in (letrec X2 be y in (xi, X2))) in y 

^nccd letrec xi be 1, y be (letrec X2 be y in (xi, X2)) in y 

->nccd letrec xi be 1, X2 be y, y be (xi, X2) in y 

-^need letrec xi be 1, X2 be y, y be (xi,X2) in (xi,X2) 

In this reduction, the recursive cons cell has been computed once and for all. No reference 
to the original cons lazy constructor remains. 

5.4.1. The letrec machine. Now we express the call-by-need letrec-calculus as an abstract 
machine. Development of this machine proceeds along the same lines as the non-circular 
machine. 

As described in the calculus, letrec bindings need not be ordered, but dependency chains 
have a natural order imposed by the order of dependencies. Furthermore, cyclic dependency 
chains are provable divergences in the letrec-calculus. In the face of such a cycle, a program 
is no longer subject to reduction. We can model this by letting the machine get "stuck". 

The basic terms for the machine remain close to those of the calculus: 

t ::= X I Xx.t \ tt\ letrec 1?+ in t 

D ::= Xi be ti 
V ::= Xx.t 

As with the calculus, D refers to sets of recursive bindings. As needed, we distinguish 
possibly empty sets of bindings, D*, from nonempty sets . This precision is needed to 
discuss machine behavior. The values of this machine are the lambda abstractions, but now 
answers are defined as values wrapped in zero or more tiers of letrec bindings. 

As we did for the prior machine, we introduce some representation changes that help 
with presenting the letrec-machine. The evaluation context is once again replaced with a 
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list of context frames. 



pXn 

XO 


::= {{xn, En) :: {xn-i,En-i) : 


{xq.Eq)} 


f 


::= at\lRD+\na 






\ IR X be D,D* mE 






lRxn+iben,P^^,D* \n E 




E 


::= [] 1 [/]o^|^o[/] 






where Eo[] = []oE = E 






and El o {E2 E^) = {Ei E2) 


0E3 



The operand frame [□ t] and binding frame [LR in □] are directly analogous to the cor- 
responding frames in the original abstract machine. The original cont frame, on the other 
hand, splits into two variants. The frame [LR x^+i be □, D* in ^'[xo]] captures chain of 
dependencies that is currently under evaluation. The expression D* stands for the inactive 
bindings in the frame, while the expression indicates a chain of dependencies. While 
no particular ordering is imposed on the bindings in D*, the chain of dependencies repre- 
sented by P^^ is ordered. Our machine representation separates dependencies from other 
bindings. In the calculus, D[xo,Xn] also includes any other bindings D. In the machine, 
P^^ only captures the dependencies, and the other bindings D* are explicitly indicated. 
Machine dependencies P^^ = {{xn,En) ■■ (x„_i,£'„_i) {xo,Eq)} correspond to cal- 

culus dependencies D[xq, Xn] = xbe E[xi],xi be -E'[x2], . . . , be -^[a;^]. We allow -P^o"^ 
to denote an empty chain. Observe that the order of the machine dependencies is reversed. 
This ordering expresses that dependencies are resolved in last-in first-out order. When the 
value of Xn+i is computed, its value will be used to compute the value of x„ and so on 
until the value of xq is computed and its value is returned to the body of the letrec expres- 
sion. Machine dependencies form a stack of computations that reference one another. This 
behavior is clarified in the behavior of the cyclic abstract machine. 

The frame [LR x be \I\,D* in E] closely resembles the old cont frame [(kx.E) □], except 
that the LR form may bind other variables as well. The absence of P^^ in this frame 
explicitly indicates that it has no chain of dependencies to be evaluated before substituting 
into the body of the LR . 

Machine answers are still binding- value pairs, but now each binding is a recursive bind- 
ing of multiple variables. 

a ::= \lE,v]\ where E = [LR in □] 

The set of redexes for the abstract machine follows. 

r ::= a t 

I lRxbea,D*\nE 

I LR Xn+i be a, PJ^" ,D*\nE 

The first redex form is the same as a form from the original machine. The second form is 
analogous to the {nx.E) a form from the original machine. The final redex form captures 
the case where one link in the chain of dependencies is about to be resolved. 

Figure [2] presents the transition rules of the cyclic abstract machine. The machine relies 
on an operator T{Eb), which given a list of binder frames [LR Df in □], flattens them into 

a single binder frame [LR Df in □]. This operation captures in aggregate the treatment of 
bindings by rules (5) and (6) of the calculus. 
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The first three refocus rules are the same as the lambda calculus, while the fourth rule 
is analogous to the equivalent rule for the let-calculus. The rebuild rules are also analogous 
to the lambda calculus, though now the binder frames have the form [LR in □]. 

The letrec-machine has two need rules, (A^-l) for a variable reference in the body of 
the corresponding letrec , and {N.2) for a variable reference that extends a (possible empty) 
chain of dependencies. The need rule does not consider variable bindings that are currently 
in a dependency chain, so evaluation will get stuck upon arriving at a cycle. 

There are four reduce rules. The rule, substitutes a value into the body of a 

letrec after its value has been computed. The {D.2) rule resolves the most recent reference 
in a chain of dependencies. Having computed the value of Xn+i, it returns to computing the 
value of Xn, which needed Xn+is value. If the chain of dependencies has only one element 
(i.e. -P^Q ), then the chain is fully resolved. The (-D.3) rule handles when an answer is 
applied to an expression. It combines calculus rules (4) and (1). Finally, the {DA) rule 
handles letrec occurrences in the source program. In order to address hygiene, this rule 
must simultaneously substitute for every bound variable in each binding as it focuses on 
the body of the letrec expression. 

The following is a machine trace of the cyclic cons example: 

(0 I [ ], letrec y be cons 1 y in y)y 

(0 I [ ], letrec y be cons 1 y in y)^ 
^ ({y}|[LRybeconslyinn],y)^ 
^ ({y}|[LRybeconslyinn],y)„ 
^ ({y} I [LRy be □ in []],cons 1 y)^ 
^ ({y} I [LRy be □ in []],cons 1 y)^ 

1-^ ({ y, xi, X2 } I [LR y be □ in [ ]] o [LR xi be 1 in □] o [LR X2 be y in □], {xi,X2))i, 
^ {{y,xi,x2 } I [ ], LR y be [[[LR xi be 1 in □] o [LR X2 be y in □], (xi,X2)JJ in [])^ 
1-^ ({ y, a;i, X2 } I [LR y be {xi,X2),xi be 1, X2 be y in □], {xi,X2))i, 
1-^ ({ y, xi, X2 } I [[[LR y be (xi, X2), xi be 1, X2 be y in □], (xi, X2)JJ) 



6. Simulating Call-by-need Using Control 

As we allude to above, call-by-need machine evaluation is highly suggestive of delimited 
control operations, but the connection is indirect and mixed with the other details of lazy 
evaluation. In this section, we directly interpret this connection in the terminology of 
delimited control. 

Based on the operational behavior of the abstract machine from Figure [H we derive a 
simulation of call-by-need execution under call-by-value augmented with deli mited control 
opera. tors. In particular, we translate call-by-need terms into the framework o vig et al.l 
(|2007l l. r^irst we overview the language of delimited control operations. Then we describe 
how the abstract machine performs delimited control operations. Next we present the 
simulation of call-by-need using delimited control. Finally we show its correctness. 



{X I E, r)^ (Reduce) 



(D.l) {X\Eu{y-Rx^e lE2,vlD* in£3))d 

{D.2) (X I El, (LRa;„+i be lE2,vlP-;^ ,D* in E^)) ^ 
{D2) {X\Ei,lE2,\x.ti\^t2)^ 
{DA) (x E, letrec Xi be U in t 



Kam {X\Ero [LR X be t;, 7-(^2), -D* in □] o Es,v), 

{X\EiO [LR Xn be □, Pxa ^ , Xn+1 

>„am {X,x'\EioE2o[LRx'bet2\nU]Mx'lx\)f x'iX 



X,x\ 



Eo^Rx\ be ti[x'Jxi\ in 0],^ 



(Refocus) 



(F.2) (X I i;, Aa;.t>^ 

(F.3) (X\EMt2)j 
{FA) {X I E, letrec 1?+ in t)^ 



{^\E.x)^ 
{X\E,Xx.t), 

{x\Eo[a t2],h)^. 

{X I E, letrec £>+ in t). 



{X\E,v)i, (Rebuild) 



(B.l) 

(i?.2) 

{B.3) 

(5.4) (X 
where = 



{X\Eio[D t]oEb,v)^ 
{X\Eio [(LR X be □. D* in ^2)] o -Bt, w)^ 
El o [(LR xn+ i be n,^^;,^* in ^2] o E;,,,^;);, 
[LR in □] 



{X\E,x)^ (Need) 



(A^.l) (X Eio[\_R{xbet,D*)\nU\oE\,x 

{N.2) {x\Eio [LR a;„ be 0,^x0""', {x be t,£)*) in E2] o Et,a;) 
where {x be t) ^ E'''' 



{X\lE,,v^) 

{X\EulE,,v^ t), 

(X I El, (LR X be lEb, i?* in Ea))^ 

(X |Ei,(LRx„+i be lE6,^;iP-;,^* in E2))^ 



(X|Eio[LRa;ben,Z)* inE2],i)^ 
{X\Eio [LR X be □, P^^J", E>* in E2], t)^ 



> 

H 

I — I 
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Figure 2: Letrec Machine 
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6.1. Delimited Control Operators. Dybvig et al. ( 200?! ) define a language with delim- 
ited control operators. We explain these operators using a simplified variant of the defining 
machine semantics. 



V 

E 



M 
P 



x \ V \ tt \ newPrompt \ pushPrompt 1 1 
withSubCont 1 1 \ pushSuhCont 1 1 
Xx.t I p I (M) 

□ I E[D t] I E[{Xx.t) □] I E[pushProinpt □ t] 
E[withSubCont □ t] \ E[withSubCont p □] 
ElpushSubCont □ t] 
[]\E: M\p: M 



The language extends the call-by-value untyped lambda calculus with the four operators 
newPrompt, pushPrompt, withSubCont , and pushSubCont as well as two new values: first- 
class prompts p, and first-class delimited continuations (M). Its control structure is defined 
using evaluation contexts E, and metacontexts M, which are lists that interleave prompts 
and contexts. Metacontexts use Haskell list notation. Prompts are modeled using natural 
numbers. 

A program state comprises an expression t, continuation E, metacontinuation M, and 
fresh prompt source p. The initial state for a program t is □[t], [ ], 0. 



E[{Xx.t) v],M,p 




E[t[v/x]],M,p 


E[newPrompt] , M, p 




E[p],M,p + l 


E[pushPrompt pi t] , M, p2 




U[t],pi : E : M,p2 


E[withSubCont pi Xx.t], Mi++{pi 


M2),P2 ^ 


a[t[{E:Mi)/x]],M2,P2 


where pi ^ Mi 






E[pushSubCont (Ml) t],M2,p 




D[t],Mi++{E : M2),p 


a[v],E : M,p 




E[v],M,p 


n[v],pi : M,p2 




D[v],M,p2 



The four operators manipulate delimited continuations, or subcontinuations, which are 
part of an execution context. The withSubCont operator takes a prompt and a function; it 
captures the smallest subcontinuation that is delimited by the prompt and passes it to the 
function. The non-captured part of the continuation becomes the new continuation. The 
prompt instance that delimited the captured subcontinuation is discarded: it appears in 
neither the captured su bcontinuati on nor the current continu ation. This operator general- 
izes J" (jFelleisenl . \l98^ ) and shift (jPanvv and Filinskil . [l990l l . 

The pushSubCont operator takes a subcontinuation and an expression; it composes the 
subcontinuation with the current continuation and proceeds to evaluate its second argument 
in the newly extended continuation. 

The pushPrompt operator takes a prompt and an expression; it extends the current 
continuation with the prompt and evaluates the expression in the newly extended con- 
tinuation. The newPrompt operator returns a distinguished fresh pr ompt each time it is 
called. These two operators generalize the delimiting operators # ( Felleisen . 19881 ) and 
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reset ( Danvy and Filinski . 1990l ). which extend a continuation with a single common de- 
Hmiter. 

To illustrate these operators in action, we consider a program that uses arithmetic and 
conditionals: 

let p = newPrompt 
in 2 + pushPrompt p 

if (withSubCont p 

{Xk.{pushSubCont k False)+ 
{pushSubCont k True))) 

then 3 
else 4 

A fresh prompt is bound to p and pushed onto the continuation just prior to evaluation 
of the if expression. withSubCont captures the subcontinuation [if □ then 3 else 4], which 
was delimited by p, and binds it to k. The subcontinuation k is pushed twice, given the 
value False the first time and True the second. The result of evaluation is the expression 
2 + 4 + 3 which yields 9. 

6.2. Delimited Control Naively Simulates the Machine. The call-by-need abstract 
machine performs two different kinds of partial control capture. To review, the rebuild and 
need rules of the abstract machine both capture some portion of the evaluation context. 
In particular, the rebuild rules capture binder frames. If only binder frames remain, then 
execution is complete. When either of the other frames is found, then a reduction is per- 
formed. On the other hand, the need rule captures the evaluation context up to the binder 
that matches the variable whose value is needed. 

These actions of the abstract machine can be recast in the language of delimited control 
capture. First, the need rule uses the identity of its variable, which must be an active 
variable, to delimit the context it captures. The well-formedness conditions from Section 1431 
guarantee that each binder frame binds a unique variable, so each active variable acts as 
a unique delimiter. Second, the rebuild rule uses the nearest non-binder frame to delimit 
the context it captures. This means that rebuild operates as though the operand frames, 
the cont frames, and the top of the evaluation context share a common delimiter. This 
guarantees that only binder frames are captured (as is stipulated in the rules). 

In short, call- by- need evaluation captures partial evaluation contexts. These partial 
evaluation contexts correspond to delimited continuations, and there are two different kinds 
of delimitation, redex-based (for rebuild) and binder-based (for need). 

It is useful to also consider how the machine manipulates these delimited continuations. 
Each reduce rule in Figure [1] immediately pushes the context associated with an answer onto 
the current evaluation context. In this manner, binders are consistently moved above the 
point of evaluation. The reduce rule then operates on the value part of the answer and the 
associated cont (for D.l) or term (for D.2). 

Although each reduce rule pushes binders onto the evaluation context, only the D.2 rule 
creates new binders. The variable bound by the answer's underlying lambda abstraction 
may already be a member of the set X, in which case it must be alpha-converted to a fresh 
name with respect to the set X. Also note that if Xx.t is alpha converted to Xx' .t[x' /x], the 
body under call-by- value satisfies the equation t[x' /x] = (Xx.t) x'. Since we are using the 
identifiers x' as delimiters, and we never turn the bin der frame [(Ax^□) t] back into a term, 
we can replace fresh variables x' with fresh prompts ( Balat et al.l . l20M ). 
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Let s be a distinguished identifier: 



MP 



runCC {let s = newPrompt in pushPrompt s AAJt]]) 



A/'|x| = need x 



do Va MM 

in let Xp = newPrompt 

in delay A/'|It2] as Xp in {va Xp) 



MlXx.tl = return Xx.Mlt\ 

return Va = withSubCont s Xka- {ka, Va) 

do X <S= ti in t2 = let {ka, x) = pushPrompt s ti 
in pushSubCont ka ^2 

delay ti as x in t2 = let fk = pushPrompt x t2 



From these observations, we construct the simulation in Figure [31 The simulation can 
be understood as a direct encoding of the abstract machine semantics for call-by-need. 
To execute a program, 7V^[[t]], the transformation uses runCC to initiate a control-based 
computation, acquires a fresh prompt, and binds it to a distinguished variable s. This 
prompt is the redex prompt, which is used to delimit every continuation that denotes a 
redex. 

To expose the conceptual structure of the simulation, we define five syntactic macros, 
do, return, delay, force, and need. We accord no formal properties to them: they merely 
simplify the presentation. The return macro captures the nearest subcontinuation that is 
delimited by the redex prompt s. Since the s delimiter appears before every reduction, 
the captured continuation is guaranteed to contain only code equivalent to binder frames. 
The translation returns a tuple containing the subcontinuation and the argument to return, 
which must be a value; the tuple represents an answer. So the translation rule for lambda 
abstractions, A/'[[Ax.t|, literally simulates the rebuild rules. 

The do macro executes a term ti under the current continuation extended with the 
redex prompt. If the term returns an answer {ka, ^) it immediately pushes the subcontinu- 
ation part and continues execution, binding the value part to the variable x. As such, the 
translation rule for applications, M^ti t2l, executes [[tij and binds the resulting operator to 



in fk XQ.ti 



force / = / 



need x = withSubCont x Xk. 

Xfth-do Va <= force /</,. 

in delay (return Va) as x 

in pushSubCont k (return Va) 



Figure 3: Translating CBN to CBV+Control 
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Va- The answer binders are pushed by the do macro, which starts the simulation of the D.2 
rule. 

The remainder of the D.2 rule is subtle. In the abstract machine, binder frame variables 
delimit the need rules. Since the delimited continuation framewor k relies on prompts to 



delimit continuations, fresh prompts literally substitute for variables (jKiselvov et al. 1.12006). 
The translation uses newPrompt to acquire a fresh prompt Xp and then uses the delay macro 
to simulate pushing a binder frame: the context delay t as x in □ is analogous to the binder 
frame [(Ax.D) t]. The delay macro anticipates that its body returns a function that 
expects the delayed argument, so it applies to a suspension of t. As we see below, the 
function is a cont {kx.E). 

In the context of delay, the simulation executes VaXp. Since alpha conversion of Xx.t can 
be written {Xxp.t[xp/x]), the term Va Xp is analogous to (Xx.t) Xp = t[xp/x]: it substitutes 
a fresh prompt for a fresh variable. 

The need macro, which defines the translation rule for variables, A/'|x]], captures the 
continuation delimited by x (which had better be a prompt!) and returns a function Xfth- ■ ■ ■ 
that closes over both x and the captured continuation k. This function is the cont kx.E, 
with X modeling the bound variable of the same name, and continuation k modeling E. 
The function expects the binder frame [(Ax.D) t], which is now at the top of the current 
continuation, to pass it the suspension A().A/'[[t]]. The simulation forces the suspension, and 
the do macro pushes the resulting answer binders and binds Va to the underlying value. 
Pushing the answer binders begins the simulation of the D.l rule. 

The simulation of D.l delays a computation that immediately returns the result Va of 
evaluating the term t, pushes the continuation k associated with the cont, and returns Va 
to the extended continuation. Now any subsequent evaluation of x immediately returns the 
memoized value Va instead of recomputing t. This yields an answer {ka,Va) where ka is an 
empty sub continuation. The value Va is delayed exactly as before and is also returned from 
the properly extended continuatio n. This part of t he tra nslation bears close resemblance 



to the paradoxical Y co mbinator (jCurry and Feyd . 19581 ). suggesting that the simulation 



requires recursive types (|Shanl . l2007l ;i. 



7. Correctness of the Simulation 

We prove correctness of the simulation relative to the machine semantics. Since we 
already proved correctness of the machine semantics relative to standard-order reduction, 
the result is a proof that our simulation provides a continuation semantics for call-by-need. 

The previous discussion provides an informal justification for the structure of the call- 
by-need simulation. To prove the corr ectness of the simul ation, we appeal to the contin- 
uation semantics for delimited control ( Dybvig et al.l . 200?! ). This semantics is completely 



standard for the terms of the lambda calculus. Figure [4] presents the interesting parts of 
the semantics. All CPS terms take a standard continuation but the control combinators 
also take a metacontinuation 7, which is a list of continuations and prompts, and a global 
prompt counter q. The base continuation kq delimits each proper continuation and directs 
evaluation up the metacontinuation, discarding any intervening prompts. Civen a CPS 
program t, the expression i kq [ ] runs it. 

To prove correctness, we compose M^-l with the delimited continuation semantics to 
produce a translation A[[-| to the A^^ calculus augmented with arithmetic, lists, and the 
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newPromptc — Xn.Xj.Xq.K q j {q + I) 
withSubContc ~ Xp.Xf.Xn.Xj.f (k : 7p kq 
pushPromptc = Xp.Xt.Xn.X'^.t Kq (p : k : 
pushSubContc — X'j' .Xt.XK.X'j.t kq (7'++(k : 7)) 
Kq — Xv.XjXq.lC{v,j,q) 

l^iv, [],q) 

K^{v,p : 7,g) = /C(w,7,g) 
IC{v, K : j,q) = K V ^ q 



Figure 4: Delimited Control Combinators 



operator /C defined in Figured! We also give each abstract machine configuration a denota- 
tion, defined in terms of name-indexed denotations for its constituents ^^H^ (see Figures [5] 
through E]). 



■'^Wx — ^H'-U'-i^tT ^ )/ 




Anti=m «o(0:[])l 








withSubContc x 




Xkx-Xki.ki 




Xfth-Xk2. 




pushPromptc (fth ()) 




{X{ka,Va). 




pushSubContc ka 




(Afc3. 




pushPromptc x 




{pushSubContc k^ 




{withSubContc Xka-Xk.k {ka, 


Va))) 


{Xfk-fk {X{). withSubContc 




Xka-Xk.k {ka, Va)) 




h)) 




k2) 




AJti ^2! = Xki. pushPromptc ApiJ 




{X{ka,Va). 




pushSubContc ka 




(Afc2. 




newPromptc 




Xxp. pushPromptc Xp {va 


Xp) 


{Xfk.fk (A().AIt2 


1) fc2)) 


h) 




AfXx.tj = WithSubContc Xka-Xk.k (fca, Ax.Ap|) 





Figure 5: Denotations for Terms 
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\X\ = \ [xi,X2,-.-,Xi,...,Xn]\ = n 

Alix I E, ?) J - vmx ^0 mmx^Ho -.[muxi + i) ce{ d, /, n } 

AUx I Ax.tDi = (Pii;]^, Ax.PM^) 

Figure 6: Denotations for Names and Configurations 

Denotations of machine configurations are constructed from their components: the 
configuration's focus ?, context E, and list of names X. A machine configuration denotes 
the translation of its focus applied to three arguments: the base continuation /to as its 
starting continuation, the denotation of its context, bounded by the redex delimiter 0, 
as the metacontinuation, and the size \X\ of X plus 1 as its initial prompt. The redex 
delimiter attached to the metacontinuation handles the case when an answer subsumes 
the entire context by returning the answer as the result. The denotation of the terminal 
machine configuration {X \ ^E, w]]) is treated separately to show how it corresponds directly 
to a final answer. 

Our semantic translation takes advantage of X being a proper list of unique names. Free 
active variables denote prompts in our translation, and since is the redex delimiter, we 
assign to each variable its 1-based index in X. We use |^| + 1 as the global prompt counter to 
ensure that no future prompts conflict with the current active variable denotations, thereby 
guaranteeing hygiene (see Section 14. 5p . 

Each evaluation context frame denotes a two-element metacontinuation consisting of a 
prompt and a proper continuation. The prompt for a binder frame is the prompt translation 
i{x, X) of the bound variable x. The cont and operand frames have redex prompts 0. These 
prompts guarantee that answer building operations will arrive at the innermost redex. Each 
continuation function specializes a subexpression of the CPS translation for terms ^^J-lj^^ 
with the denotations of the context frame's parts. Compare, for instance, the denotation 
of an application, ti t2, to that of an operand frame, [□ t2]. The application term pushes 
the global prompt, and executes ti in the context of a continuation that receives an answer 
{ka,Va)- The denotation of the operand frame is a metacontinuation containing the same 
prompt and continuation. 

A redex denotes a CPS'ed term that closes over the denotations of its constituents and 
implements the corresponding reduction step. 

To facilitate our proof of correctness, we make a slight change to the machine semantics. 
In the machine, composing an empty context with the current context is an identity oper- 
ation. The continuation semantics do not share this property. During execution, an empty 
continuation is denoted by the base continuation kq. If a continuation is captured or pushed 
in the context of an empty continuation, then the empty continuation will be captured as 
part of the metacontinuation or pushed onto the current metacontinuation before reinstat- 
ing the pushed continuation. In short, the call- by- need machine semantics guarantees that 
o [ ] = E, but the continuation semantics do not prove that : 7 = 7. Dybvig et al. 
discuss the notion of proper tail recursion for delimited continuations. Their operational 
characterization of proper tail recursion corresponds to the latter equation. 
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n[]ix = u 

WIx = «o:[] 

VIP t2]ix - : fc' : [ ] 
where k' ^ \ {ka, Va). 

pushSubContc ka 

(Afc2. 

newPromptc 

Xxp.pushPromptc Xp {va Xp) 

{Xfk-fk iXO-VMx) k2)) 

Ko 

VliiXxn) t2]i^^i{x,X):k' :[] 
where k' = Xfkjk (XO-V^x) «o 

Vlii^x.E) □]l^=0:fc': [] 
where k' = 

A {ka,Va)- 

pushSubContc ka 

pushPromptc l{x,X) 
{pushSubContc Pji?]]^ 

{withSubContc Xka-Xk.k {ka,Va))) 
(Xfk-fk {X{). WithSubContc Xka-Xk.k {ka,Va)) 

h)) 



Figure 7: Denotations for Evaluation Contexts 



To remove this mismatch, we add a ghost frame [^] to our definition of evaluation 
contexts. The ghost frame denotes the metacontinuation : [ ]. We also extend the unplug 
operation on evaluation contexts such that it discards ghost frames: ClE o [:^]]] = C|{£'|. 
Finally, we alter the right hand side of transition rules that grab and push continuations 
to pair ghost frames with composed evaluation contexts in a manner consistent with the 
continuation semantics. For instance, the updated D.l rule is as followfS: 

{X\Ei,{kX.E2) m,vW)^^nam {X \ Ei O [#] o E; o [{Xx-D) v] o [#] o E2, v) , (D.l) 

These modifications do not alter the observable behavior of the machine while modeling the 
property that pushing empty frames has meaning in the continuation semantics. 

Given these denotations, it is straightforward to prove correctness of the simulation 
relative to the abstract machine. 

Theorem 7.1. If t is a dosed term, then A^[t| = A[[(0 | [ ], 

Proof A^ltj = (0 : [ ]) 1 = AI(0 | [ ], t)^]. □ 

The exact placement of ghost frames fahs right out of the correctness proof. 
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P UjOl to UiUK^U lib(^ [[-/-/2 Jl 








piLofli 1 (Jliipic t.\^Xl,yv J 




ypuoito ilukjU file li^ijl 




{withSubContc Xka-Xk 


k{ka,Xx2-Mx))) 


{Xfk-fk {XQ. withSubContc 


0Xka.Xk.k{ka,Xx2Mx)) 


fcs)) 








pushSubContc TilEl^ 




{\k2. 




newPromptc 




Xxp.pushPromptc Xp ((A 









Figure 8: Denotations for Redexes 



Theorem 7.2. //Ci ^nam C2 then AlCij = AIC2}. 

Proof. By cases on 1 — ^nam ■ The proof utihzes beta, eta and /C equivalences to estabhsh 
correspondences. □ 



8. Simulating Extensions 

Many straightforward language extensions also have straightforward simulations under 
the above model. 

Simulating let bindings essentially performs the same operations as immediately apply- 
ing a lambda abstraction. 

Afpet X = ti in t2j = let x = newPrompt 
in delay 7V|ti| as x 
in MM 

In contrast to the application rule, the variable x is directly assigned a fresh prompt, rather 
than binding it to an auxiliary variable Xp. The body of the let can be interpreted in place 
and substitution of the prompt is implicit since x is already free in t2- 

The translation for basic constants is analogous to that for lambda abstractions: the 
constant must be returned to the next redex. 

ATJcI = return c 

For this translation, we assume that the call-by-value language provides the same constants 
as the call-by-need language. 

The translation for function constants is as follows: 

Aflf tj= do Va^Afm 
in return (/ Va) 

Interpreting a function constant application forces its argument and then acts on the value 
that is ultimately produced. Since function expressions yield values, the result is immedi- 
ately returned. 



34 



R. GARCIA, A. LUMSDAINE, AND A. SABRY 



The translation for cons acquires two fresh prompts, uses them to delay the argument 
to cons, and stores them as a pair. 

A/'[[cons ti = let xi = newPrompt in 
let X2 = newPrompt in 
delay AAKtil as xi in 
delay A/'[[t2l as X2 in 
cons xi X2 

The translations for car and cdr evaluate their argument, retrieve a prompt from the 
resulting pair, and demand its value. 

AAJcar tj =doVp^ Mftj 
in need (car Vp) 

A/'[[cdr tj = do Up ^ Mltj 
in need (cdr Vp) 



9. Conclusions 



In this paper, we expose and examine the operational structure of lazy evaluation as 
embodied in call-by-need semantics. We present this understanding in two ways: as an 
abstract machine whose operational behavior involves control capture, and as a simulation 
of call- by- need under call- by- value plus delimited control operations. Delimited control can 
be used to simulate a global heap, but our particular simulation uses delimited control 
operations to manage laziness locally, just like the calculus reduction rules. 

The artifacts of this investigation provide new tools for increasing our understanding 
of lazy evaluation and its connections to control. The abstract machine could be used to 
establish connections to heap-based implementations of call-by-need , and possibly modern 
graph-reduction based formulations (j Pevton Jones and Salkild . 1989^). In fact it seems that 
the calculus and abstract machine may point out new structural and dynamic invariants that 
are inherent to call-by-need evaluation but are hidden in the unstructured representations 
of heaps. 

The abstract machine and simulation might also provide new opportunities for reason- 
ing about the correctness of transformations applied to call-by-need programs. Surely the 
calculus provides the same equational reasoning powers as the abstract machine. However 
the machine may enable researchers to more easily conceive transformations and justifica- 
tions that are not as easy to recogn i ze in t he reduction semantics. Our simulation might be 



connected to that of Okasaki et al 



The simulation might suggest new mechanisms 
by which to embed call- by-need evaluation within call- by- value programs. 

One significant differ e nce between the tw o forin ulations of call-by-need lambda cal- 
culi ( Maraist et al. . 19981 : Ariola and Felleisen . 1997 ) is the status of variables. Maraist 
et al. consider vari ables to be values, whereas Ariola and Felleisen do not. Ultimately, 
Maraist et al. (Il998l ^ prove standardization against a standard-order relation that does not 
consider variables to be values. This paper sheds no light on the inclusion of variables 
among the values, however it demonstrates in stark detail the consequences of the latter 
design. In the abstract machine, the transition rules for lambda terms, namely the rebuild 
rules, differ significantly from the transition rules for variables, the need rules. A similar 
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distinction can be seen simply by observing the complexity of their respective translations. 
In short, our semantics interpret variables as memoized computations rather than values. 
Variables can be treated as values under deterministic call-by-valuc and call-by-name re- 
duction; it remains an open question whether the same could be achieved for call-by-need 
and if so what its operational implications would be. 

Our results reveal that a proliferation of semantic frameworks — reduction semantics, 
machine semantics, etc is a boon and not a crisis. The reduction semantics of call-by-need 
elegantly and mysteriously encode a rich semantics whose broad implications can be seen in 
equivalent machine semantics and continuation semantics. As such, our work provides new 
perspectives from which to reason about call-by-need, delimited control, and their respective 
expressive powers. 
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